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MEMORY SYSTEM HAVING STUB BUS CONFIGURATION 

RELATED APPLICATIONS 

This application claims the benefit of United States Provisional Application Serial No. 
60/267,908, filed February 9, 2001, the contents of which are incorporated herein by reference, in 
their entirety. 

BACKGROUND OF THE INVENTION 

Since the introduction of synchronous dynamic random access memories (SDRAMs) 
wherein the data transfer rate of the memory system is directly dependent on system clock rate, a 
number of advances have been made to further increase the data rate. For example, double data 
rate (DDR) SDRAMs further increase data transfer rates by causing the data 
transmission/reception to occur at both rising and falling edges of the system clock. In order to 
assign a sufficient timing budget in receiving data from a memory controller and a DRAM 
module with increased data transfer rate, the concept of source synchronous clocking, which 
allows routing of a synchronous clock through the same path as that of the data, has been 
introduced. This concept is illustrated in FIGS. 1 and 2. 

FIG. 1 illustrates source synchronous clocking using a bidirectional strobe signal DQS as 
well as the system clock signal CK in a DDR SDRAM. The DQS signals are generated by either 
a memory controller 20 or DRAM devices 22 and are used as timing reference signals for writing 
data to the DRAM modules 22 and for reading data from the DRAM modules 22. The data are 
transferred via the data bus DQ using the DQS strobe signals as a timing reference. The DQS 
strobe signals are routed through the same path as the DQ signals. The memory controller 20, or 
DRAM modules 22, sample the data respectively using the strobe signals DQS transmitted 
through the same path as the data bus, instead of using the system clock CK as a reference. The 
system clock (CK) is used for sampling the command/address signals (Com/ Add) at the DRAM 
22, and DQ signals sampled by the DQS signal are translated to the system clock domain 
internally both at the DRAM 22 and at the controller 20. 
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FIG. 2 illustrates a typical rambus DRAM configuration. In this configuration, a DRAM 
module 26 and a memory controller 24 receive two clocks, namely a Clk-From-Master (CFM) 
signal a Clk-To-Master (CTM) signal. The CFM signal is used as a reference clock for write 
data, and the CTM signal is used as a reference clock for read data. In each case, the data and 
clock signals are routed and transferred through the same data path. 

For the DDR SDRAM approach illustrated in FIG. 1, a preamble timing interval is 
required following a read or write command. The preamble interval includes information 
regarding entry of a valid data strobe signal, and is necessary because the DQS signal can be 
generated by either the memory controller 20 or the DRAM 22. If the source of the DQS signal 
changes, then the preamble period must be initiated to pass the token from one to the other. 
Typically, this preamble interval consumes an entire clock cycle before valid data can be 
transmitted. Since transmission of data is not permitted during the preamble interval, overall 
efficiency of the system bus is adversely affected. In the rambus DRAM approach illustrated in 
FIG. 2, the use of the forward clock CFM and the return clock CTM again results in the crossing 
of two clock domains within a DRAM module 26. The resulting configuration is therefore 
complicated since sophisticated compensation circuitry is required to manage the phase 
difference between the CFM and CTM clock signals, and since an initialization procedure is 
needed for the phase compensation circuitry. In both DDR SDRAM and rambus DRAM 
architectures, the pin count for the memory module connector is increased due to the increased 
number of clock pins, and in the case of rambus, the data pin count is doubled. 

SUMMARY OF THE INVENTION 

The present invention is directed to a memory system having a stub bus configuration that 
transmits a free-running clock through the same path as data signals and a method for 
transferring data in such a system. A single clock domain is employed for both read and write 
operations. For both operations, the read or write clock signal is routed through the same 
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transmission path as the data, thereby increasing system transfer rates by maximizing the window 
of data validity. In this manner, data bus utilization is increased due to the elimination of a need 
for a preamble interval for the strobe signal, and pin count on the memory module connectors is 
therefore reduced. 

In one embodiment, the present invention is directed to a memory system having a stub 
configuration and a method for transferring data in such a system. The system includes a 
controller for generating a first clock signal, a control signal, an address signal, and data signals 
on a data bus, the data bus, first clock signal, control signal, and address signal being arranged in 
a stub configuration. A memory module includes a memory device coupled to the controller, the 
memory module receiving the first clock signal and the control signal that includes a read or 
write command. The memory module, in response to the write command, initiates a write 
operation for writing the data signals from the data bus to the memory in synchronization with 
the first clock signal. The memory module, in response to the read command, initiates a read 
operation for reading data from the memory to the data bus in synchronization with the first clock 
signal, and generates a second clock signal in response to the first clock signal, the second clock 
signal being provided to the controller, the controller receiving the data signals on the data bus in 
response to the second clock signal during the read operation. 

In a preferred embodiment, the first clock signal comprises a write clock and the second 
clock signal comprises a read clock. The system preferably comprises multiple memory modules 
wherein the multiple memory modules each generate independent second clock signals, the 
second clock signals each being different in phase. The phases of the multiple second clock 
signals are different in phase due to the difference in propagation delay between each of the 
memory modules and the controller. The propagation delay of the second clock signal from the 
memory module to the controller is substantially equal to that of the data bus. 

The memory module may further include a control buffer that receives the first clock 
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signal and the control signal and generates the second clock signal in response to the first clock 
signal. A phase locked loop or delay locked loop may be provided for generating the second 
clock signal in response to the first clock signal, within the control buffer, or independent of the 
control buffer. Alternatively, a return path may be coupled directly to a first clock signal line that 
receives the first clock signal for generating the second clock signal in response to the first clock 
signal. 

A capacitor may be included having a capacitance that is selected to compensate for 
capacitive loading on the data bus by the memory device of the memory module; the capacitor 
being coupled to a junction of the first clock signal line and the second clock signal line. 

The memory system may include first and second memory modules, the memory module 
generating respective first and second independent return clock signals as the second clock 
signal, and further comprising a motherboard coupling the first and second memory modules and 
the controller, the motherboard including the data bus, a control bus for transfer of the control 
signals, an address bus for transfer of the address signals, a first clock signal line for transfer of 
the first clock signal and first and second independent return clock signal lines for transfer of the 
first and second return clock signals. The first and second return clock signal lines are crossed 
on the motherboard between the first and second modules so as to ensure similar pinout 
configurations at the connectors of the memory modules. The first return clock signal line is 
preferably coupled to a dummy load on the second memory module and the second return clock 
signal line is coupled to a dummy load on the first memory module. The dummy load preferably 
comprises a load capacitor or a dummy pin and is selected to match the capacitance loading of 
the data bus. 

A first flag signal may be generated by the memory controller, the memory module, in 
response to the first flag signal, controlling timing of initiation of the write operation or read 
operation. If a read operation is commanded, the memory module generates a second flag signal 
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in response to the first flag signal, the second flag signal being provided to the controller, the 
controller receiving the data signals on the data bus in response to the second clock signal and the 
second flag signal during a read operation. 

The memory system preferably includes first and second memory modules, the memory 
module generating respective first and second independent second flag signals, and further 
comprising a motherboard coupling the first and second memory modules and the controller, the 
motherboard including the data bus, a control bus for transfer of the control signals, an address 
bus for transfer of the address signals, a first flag signal line for transfer of the first flag signal, 
and first and second independent return flag signal lines for transfer of the first and second return 
flag signals, the first flag signal line and the first and second return flag signal lines being routed 
with the control bus and the address bus. The first and second return flag signal lines may be 
crossed on the motherboard between the first and second modules. The first return flag signal 
line may be coupled to a dummy load on the second memory module and the second return flag 
signal line may be coupled to a dummy load on the first memory module. The dummy load 
preferably comprises a load capacitor or a dummy pin that is selected to match the capacitance 
loading of the data bus. 

The control buffer may be mounted to a first side of the memory module, and a dummy 
load couples to a first signal line of the control buffer to provide load matching with a load 
experienced by a second signal line of the memory devices mounted to both first and second 
sides of the memory module. The dummy load may comprise, for example, a load capacitor or a 
dummy pin. The first signal line may comprise the first clock signal or the second clock signal, 
and the second signal line may comprise the data bus or the first clock signal. 

In another embodiment, the present invention is directed to a memory system having a 
stub configuration and a method for transferring data in such a system. The system includes a 
controller for generating a first clock signal, a control/address signal, and data signals on a data 
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bus, the data bus, first clock signal, and control/address signal being arranged in a stub 
configuration. A second clock signal generator generates a second clock signal. A memory 
module includes memory coupled to the controller, the memory module receiving the first clock 
signal, the second clock signal and the control/address signal that includes a read or write 
command. The first clock signal propagates from the controller to the memory module in a first 
direction of propagation, and the second clock signal propagates from the memory module to the 
controller in a second direction of propagation. The memory module, in response to the write 
command, initiates a write operation for writing the data signals from the data bus to the memory 
based on the first clock signal. The memory module, in response to the read command, initiates 
a read operation for reading data from the memory to the data bus in response to the second clock 
signal, the controller receiving the data signals on the data bus in response to the second clock 
signal during the read operation. The memory controller preferably compensates for phase 
difference between the received second clock signal and the data signals on the data bus. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The foregoing and other objects, features and advantages of the invention will be apparent 
from the more particular description of preferred embodiments of the invention, as illustrated in 
the accompanying drawings in which like reference characters refer to the same parts throughout 
the different views. The drawings are not necessarily to scale, emphasis instead being placed 
upon illustrating the principles of the invention. 

FIG. 1 is a schematic block diagram of a conventional double data rate (DDR) 
synchronous dynamic random access memory (SDRAM) employing a bidirectional strobe signal 
DQS and a system clock signal CK. 

FIG.2 is a block diagram of a conventional rambus DRAM configuration employing a 
forward clock CLK_FROM_MASTER and a return clock CLK_TO_MASTER. 

FIG. 3 is a schematic block diagram of a memory system according to the present 
invention, wherein the C/A buffer generates a read clock signal RCLK based on the write clock 
signal WCLK. 
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FIG. 4 is a schematic block diagram of an alternative embodiment of the present 
invention wherein the memory modules include a return path for returning the write clock signal 
WCLK as the read clock signal RCLK. 

FIG. 5 is a schematic block diagram of a second alternative embodiment of the present 
invention wherein the memory modules include a phase locked loop (PLL) for generating the 
RCLK signal in synchronization with the WCLK signal. 

FIG. 6 is a schematic block diagram of a third alternative embodiment of the present 
invention wherein the memory modules further generate an RFLAG signal in synchronization 
with a received DFLAG signal. 

FIG. 7 illustrates a command/address (C/A) buffer including dummy pins in accordance 
with the present invention. 

FIG. 8 illustrates a C/A buffer including dummy capacitors in accordance with the present 
invention. 

FIG. 9 is a schematic block diagram of a memory system in accordance with the present 
invention wherein the RCLK signal is generated remotely and propagates along the system bus in 
a direction opposite that of the write clock WCLK signal. 

FIG. 10 illustrates phase delay generated between the RCLK and read data DQ during a 
read operation in the embodiment of FIG. 9 due to the stub configuration, in accordance with the 
present invention. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

FIG. 3 is a schematic block diagram of a memory system having two memory modules 
configured in accordance with the present invention. The system includes a controller 40, first 
and second memory modules 42A, 42B respectively, a system bus 50 interconnecting the 
memory modules 42A, 42B and the controller 40, and termination resistors 48 for terminating the 
system bus 50. The memory modules 42A, 42B are configured in a stub arrangement, meaning 
that the signals pass to and from the memory modules 42 from the system bus 50 as a stub, or 
branch extension, of the system bus 50, as opposed to a point-to-point or serial configuration.. In 
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this configuration, all clock, data, and command/address signal lines reaching the memory 
modules are stubs of the system bus. Each memory module 42A, 42B includes a plurality of 
memory devices 52, for example DRAM units DRAMl-DRAMn and a command/address buffer 
46A, 46B. The memory devices 52 may extend along the module body to the left and right of the 
C/A buffer 46A, 46B as shown, however, for the purpose of illustrating the present invention, 
only the left half of the module is illustrated in FIG. 3. The C/A buffers 46A, 46B receive 
command and/or address (C/A) signals in synchronization with the write clock signal WCLK. 
L , The received C/A signals are in turn provided to the DRAMs 52 via C/A signal line 44A, 44B, 
9 again in synchronization with the WCLK signal. The use of a C/A buffer 46A, 46B on each 
«p memory module 42A, 42B alleviates loading of the command/address bus C/A on the system 
K motherboard. 

si 

^ The system bus 50 further includes a number of data bus signals DQl-DQn that 

correspond to memory devices 52 DRAMl-DRAMn on the memory modules 42 A, 42B. Each of 
•=? data bus signals DQl-DQn is in turn comprised of multiple data signal lines, for example, 4, 8, or 
C3 16 lines. The write clock signal WCLK is buffered and distributed at the memory controller 40 

IU 

and routed along with corresponding data bus signals DQl-DQn, as well as the 
command/address C/A signal, so as to ensure that the data bus DQ signals and the C/A signals 
arrive at the memory modules 42A, 42B at the same time as the corresponding WCLK signal. 

In this manner, in the case of a write operation where it is intended to write data from the 
memory controller 40 to a memory device 52, the memory controller 40 outputs the write data 
DQ in synchronization with the write clock signal WCLK, and thus transmits the write data DQ 
to the memory devices 52 on the appropriate memory module 42 A, 42B. Since the data bus DQ 
and the write clock line WCLK have the same stub configuration, the write data on the DQ bus 
and the write clock signal on the write clock line WCLK are input at the memory devices 52 
subject to an equal propagation delay time. In this manner, assuming a given memory device 52 
samples the write data DQ using the write clock signal WCLK, a maximum valid data window is 
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achieved. 

On the other hand, in the read operation, that is the case of the transfer of data from the 
memory modules 42A, 42B to the memory controller 40, a read clock (RCLK) signal is output 
from the C/A buffer 46A, 46B, and is transferred to the memory controller 40 along with the data 
read from the memory device 52. When the memory device 52 in a memory module 42A outputs 
the read data DQ in synchronization with the WCLK signal, the C/A buffer 46 also outputs the 
read clock signal RCLK at the same time. By synchronizing the read clock RCLK signal with 
the received write clock WC?LK signal, the read data DQ and read clock RCLK signals arrive at 
the memory controller 40 with an equal propagation delay because they have the same stub bus 
path length along the system bus 50 and along the stub path between the memory module 42 and 
the system bus 50. The configuration of FIG. 3 thus alleviates a phase differential problem that 
would otherwise occur in transferring data that is read from the memory devices 52 to the 
memory controller 40. Assume, for a moment, that the read clock signal RCLK, rather than 
being synchronized with the received write clock signal WCLK, is instead sourced at the memory 
controller 40 and distributed to the memory modules 42 in a stub bus configuration. In this case, 
the direction of the transmission of the read clock RCLK would have an opposite direction to that 
of the read data DQ and therefore would have an entirely different phase relationship with the 
read data DQ, Assume also a different case where the read clock RCLK is again distributed in a 
stub bus configuration to all memory modules 42 and to the memory controller 40, but is sourced 
at a location that is on the system bus 50 beyond the outermost memory module 42B. In this 
configuration, discussed with reference to FIGs. 9 and 10 below, the resulting timing relationship 
would be improved since the read data DQ and the read clock RCLK would be propagating in the 
same direction. However, the read data DQ and the read clock RCLK as received at the 
controller 40 would have a timing difference on the order of two times the stub delay, since in 
order for the data to be activated on the data bus DQ, the read clock signal RCLK is first received 
by the memory device 52 via one pass of the stub and the read data DQ transmitted through a 
second pass of the stub to the system bus 50 and to the memory controller 50. Thus, the read 
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data DQ received at the memory controller 40 is delayed, with respect to the RCLK signal, by a 
time period corresponding to two times the propagation delay of the stub path of the module 
from which the read data DQ is sourced. Therefore a longer setup time period is necessary 
before the read data DQ can be sampled by the controller. This is an inherent problem of the stub 
bus architecture. 

The problem identified above is alleviated by the memory system configured in 
accordance with the present invention. In this configuration, read data DQ is output from a 
memory device 52 located on any of the memory modules 42 A, 42B in synchronization with the 
received write clock signal WCLK. For example, assuming that data is to be read from a 
memory device 52 on the first memory module 42A, a first read clock signal RCLK_M1 is 
output from the C/A buffer 46A in synchronization with the received write clock signal WCLK 
and transferred to the memory controller 40 via a path having the same path length as that of the 
read data DQ. Because the read clock signal RCLK_M1 and read data DQ propagate along the 
same stub bus path, they arrive at the memory controller 40 with an equal delay time and 
therefore, the read data DQ can be sampled using the read clock signal RCLKJMl immediately, 
without the need for phase compensation or a preamble interval as required by the conventional 
strobe approaches. 

The same scenario applies in the case where data is to be read from a memory device 52 
on the second memory module 42B. A read clock signal RCLK _M2 is generated in response to 
the write clock signal WCLK and is synchronized therewith. The read clock signal RCLK_M2 is 
transmitted via the system bus 50 along with the read data DQ, and is received at the memory 
controller 40 as a second read clock signal RCLK_M2 that is separate from and independent of 
the first read clock signal RCLK Ml. Separate read clock signals RCLK_M1, RCLK M2, are 
necessary because the phases of the read clock signals RCLK from the plurality of memory 
modules 42A, 42B are different due to the difference in signal path length between each 
respective module 42A, 42B and the memory controller 40. In order to permit the memory 
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modules 42A, 42B to have the same pin configurations, an advantageous feature for mass 
manufacturing of the modules, the read clock signal lines RCLKJVll, RCLK_M2 are crossed on 
the motherboard hosting the system bus 50, as shown at location 53. 

In order to further improve performance, capacitive loading on the DQ bus and RCLK 
bus should be equal for all memory modules 42A, 42B. To achieve this, the RCLK_M1 signal 
may be connected to a dummy load capacitor or a dummy pin 54B of the second memory module 
42B. Likewise, the RCLKJM2 signal is connected to a dummy load capacitor or a dummy pin 
54A of the first memory module 42A. In configuration employing more than two memory 
modules 42A, 42B, the read clock signals RCLKJVIn of the other modules in the system would 
be tied to dummy load capacitors or dummy pins of a given module. If a capacitor is used, the 
capacitance value should be chosen to equal the input capacitance of the RCLK pin. If a dummy 
pin is used, the design of the C/A buffer would ensure the input capacitance of the dummy pin to 
be equal to that of the RCLK pin. 

In the above example of FIG. 3, the C/A buffer 42A, 42B preferably includes a phase 
locked loop (PLL) or delay locked loop (DLL) that serves to generate a read clock signal RCLK 
that is synchronized with the received write clock signal WCLK; namely, the transition edges of 
the RCLK signal are aligned with those of the WCLK signal. In the phase locked loop (PLL), the 
phase of the voltage controlled oscillator is controlled until the output clock edge is aligned to 
that of the WCLK. In the delay locked loop (DLL) the WCLK is applied to a variable delay line 
the delay of which is controlled until the output clock edge is aligned with that of the WCLK. 

FIG. 4 is a schematic block diagram of an alternative embodiment of the present 
invention for the case where the C/A buffer 46A, 46B on the memory module 42A, 42B is 
incapable of generating the RCLK signal in synchronization with the WCLK signal, for example 
in the case where the C/A buffer does not include a PLL or DLL. In this embodiment, the 
WCLK signal is made to drive a dummy capacitor 60A, 60B on each module 42A, 42B, the 
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dummy capacitor 60A, 60B having a capacitance value that emulates the capacitance of the 
memory device 52 from which data is being read. The capacitance experienced by the WCLK 
signal should be equal to that experienced by the DQ signal, as the flight time from the memory 
controller to the DRAM should be equal for both DQ and WCLK. The line of the respective read 
clock signal RCLKMl, RCLK_M2 from the corresponding module is coupled directly to the 
received WCLK signal line. In this manner, the read clock signal RCLK is generated by 
returning the write clock signal WCLK received at each module 42A, 42B and is therefore 
synchronized therewith. Therefore, upon the memory controller 40 initiating a read activity from 
a memory module 42, the read data DQ is received by the memory controller 40 in 
synchronization with the read clock RCLK of the corresponding module. As explained above, it 
is preferable that each module 42 A, 42B have an associated read clock signal RCLKJMl, 
RCLKJV12 that is independent of the other modules signals. In addition, the read clock signal 
paths RCLK_M1, RCLK_M2 are crossed at location 53 on the motherboard hosting the system 
bus 50 to ensure a similar pin configuration for all modules. As described above, additional 
dummy load capacitors 54A, 54B are tied to the read clock signals that are not utilized by a given 
module in order to provide for equalization with the loading of the data bus DQ. 

FIG. 5 is a schematic block diagram of an alternative embodiment of the present 
invention applicable, for example, to memory modules 42A, 42B that do not include a C/A 
buffer 46. In this configuration, the C/A signal bus is distributed from the memory controller 40 
via the system bus 50 to the memory devices 52 on the memory modules 42A, 42B via signal 
paths 44A, 44B. A phase locked loop (PLL) 62A, 62B, or alternatively a delay locked loop 
(DLL), is provided on each module 42A, 42B for receiving the write clock WCLK signal and for 
generating a respective read clock RCLK signal for the module 42A, 42B, Phase locked loops 
and delay locked loops are well-known mechanisms for ensuring that an output signal is 
generated so that the transition edges of the output signal are aligned to that of an input signal. 
The operation of the phase locked loop of FIG. 5 is the same as that of the phase locked loop 
included in the C/A buffer of 46 of FIG. 3. As described above, dummy load capacitors or 
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dummy pins 54A, 54B can be provided for read clock signals RCLK not associated with a given 
module to provide for equalization with the loading of the data bus DQ. 

In traditional memory systems, it is common for the data bus DQ to operate at a rate that 
is two times faster than the command/address C/A bus. For this reason, commands are provided 
to the memory modules in advance of the data so that the memory devices on the module have 
sufficient time to prepare for the data read or write operation. The latency between the command 
and data signals is commonly referred to as column address strobe (CAS) latency. With 
reference to FIG. 6, a DFLAG signal, generated by the memory controller 40, provides the CAS 
latency information for both read and write operations to the modules 42A, 42B. As shown in 
FIG. 6, the C/A buffer 46A, 46B receives the DFLAG signal from the memory controller 40 and 
outputs the DFLAG signal to each memory device 52 on the module 42A, 42B via the buffered 
C/A signal 44A, 44B. Upon sensing a transition in the DFLAG signal, each memory device 52 
outputs read or write data on the data bus DQ following a predetermined time interval. The 
DFLAG signal is received in synchronization with the write clock WCLK signal and the data bus 
signals DQ by virtue of the DFLAG signal being configured in the same stub bus structure. For 
this reason, the DFLAG signal will experience the same propagation delay as the WCLK and 
data bus DQ signals in the direction of propagation between the memory controller 40 and the 
memory modules 42 A, 42B. 

The C/A buffer 46 may optionally generate a flag signal for the return path RFLAG in 
synchronization with the received DFLAG signal in the same manner that the read clock RCLK 
signal is generated in synchronization with the write clock WCLK signal. For example, a delay 
locked loop or phase locked loop may be employed within the C/A buffer 46 to generate the 
return flag signals RFLAG Ml, RFLAG_M2 based on the received DFLAG signal. The 
RFLAG signal carries timing information related to when read data DQ output from the memory 
devices 52 will arrive at the memory controller 40. While the memory controller 40 can receive 
valid data transferred from the memory devices 52 on the modules 42A, 42B in synchronization 
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with the read clock signal RCLKM1, RCLK_M2 transferred from the memory module, it is 
possible for the memory controller to receive invalid data from the memory devices 52 should 
the time difference between the WCLK and RCLK signals at the controller be greater than one 
clock cycle. The RFLAG signal ensures that valid data is received by the memory controller 40 
at all times, and as such, the memory controller 40 receives the data in response to the read flag 
signal RFLAG_M1, RFLAG_M2 and read clock signal RCLK_M1, RCLKJVT2 transferred from 
the module 42A, 42B. In the same manner, the memory devices 52 on the module 42A, 42B 
receive the data in response to the DFLAG and WCLK signals transferred form the controller 
42A, 42B. 

Accordingly, the memory controller 40 recognizes the read data DQ arrival time via the 
RFLAG signal output by the C/A buffer 46A. The RFLAG signal has the same propagation 
delay time as the read data DQ signals because the RFLAG signal is configured to have the same 
bus stub structure as the DQ signal Assuming that the memory controller 40 is able to 
compensate for a difference in phase between the DFLAG signal generated at the controller 40 
and a read data DQ transmitted by the memory modules 42, then there is no need for the separate 
RFLAG signal. Assuming such a need, the RFLAG signal can be generated, and routed in the 
same manner as the RCLK signals. Preferably, the RFLAG and DFLAG signals are routed with 
the control/address (C/A) bus signals, instead of with the data bus DQ signals, since, if the 
RFLAG and DFLAG signals were to be routed with the data bus DQ signals, then multiple 
RFLAG and DFLAG signals would be needed - at least one for each memory device 52. 
However, if the RFLAG and DFLAG signals are routed on the C/A bus, only one of each 
RFLAG and DFLAG is needed. It is further preferable that each C/A buffer 46 of each memory 
module 42A, 42B generate and transmit an independent RFLAG signal RFLAGJVQ, 
RFLAGJM2 to the memory controller 40 since respective path lengths from the modules 42A, 
42B generating the independent RFLAG signals to the memory controller 40 are different. 
Loading equalization can be achieved by including dummy capacitors or dummy pins 68A, 68B 
as described above. Similar pin out configurations can be achieved for the different modules by 
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crossing signal paths, for example at location 53 on the system bus 50, as explained above. 

Memory modules can be generally categorized as single-sided modules or double-sided 
modules; single-sided modules having memory devices on one face of the module, and double- 
sided modules having memory devices on both faces of the module. Loading of the data bus DQ 
and write clock line WCLK varies depending on the type of memory module. For example, 
loading of the WCLK signal is greater in a double-sided module as compared to that of a single- 
sided module, since the WCLK line is routed to two WCLK pins on the double-sided module - 
one for the front-side DRAM, and the other for the rear-side DRAM, while on a single-sided 
module only one connection is needed. In contrast, loading of the write clock WCLK input to 
the C/A buffer, and loading of the read clock signal RCLKJVQ, RCLK_M2 output from the 
C/A/ buffer are the same for the single-sided module and double-sided module, since the C/A 
buffer is typically mounted on only one side of the module. Depending on the type of module 
(single-sided/double-sided), a skew caused by the difference in loading of the DQ bus, WCLK 
line, and RCLKJVIl, RCLK_M2 may occur. To address this issue, dummy pins 72A, 72B can 
be provided to the C/A buffer as shown in FIG. 7. For the double-sided module, a second set of 
dummy contacts 72A, 72B are provided and connected to the WCLK/RCLK/RFLAG pins 70A, 
70B as shown. Alternatively, switchable capacitors 74 A, 74B may be provided in a C/A buffer 
as shown in FIG. 8, for the double-sided module, the capacitance value of the dummy capacitors 
74A, 74B being set to match the extra capacitance experienced by the DQ and WCLK lines at the 
second side of the module. In the case of the double-sided module, the DQ and WCLK lines 
connected to the DRAMs see two input pins, one at the front side and the other at the back side 
of the module, while the WCLK, RCLK, and RFLAG lines connected to the C/A buffer see only 
one input pin, since the C/A buffer is typically mounted on only a single side of the module. 
Therefore, a loading mismatch exists between the DQ/WCLK lines connected to the DRAM 
devices on both sides of the module and the WCLK/RCLK/RFLAG lines connected to the C/A 
buffer on one side of the module. To alleviate this loading mismatch, the C/A buffer 46 of the 
present invention provides dummy pins 72 A for the WCLK, RCLK and RFLAG lines, or 



alternatively, a dummy capacitor 74A, 74B for those lines. 
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With reference to FIG. 9, an alternative embodiment of the present invention is provided. 
In this embodiment, instead of generating the read clock RCLK based on, and in synchronization 
with, the received write clock WCLK, a separate read clock generator 76 is provided for 
generating a read clock signal RCLK that is distributed across the system bus 50 in a direction 
opposite the direction of propagation of the write clock WCLK. The read clock line RCLK is 
terminated at the memory controller 40. In this embodiment, the inherent problem of the stub 
bus architecture is apparent. With reference to FIG. 10, assume that the RCLK signal is 
generated at generator 76, and assume that data is to be read from a memory device located on 
module 42A. In this scenario, the RCLK signal is generated and propagates along the RCLK line 
directly to the memory controller 40. Memory module 42A receives the RCLK signal from the 
RCLK line following a time delay represented by the propagation of the RCLK signal along the 
path of dashed line 80A. The data to be read DQ are then released onto the DQ bus via the path 
represented by dashed line 80B. The read data DQ then continue to propagate toward the 
memory controller 40A. In this manner, the RCLK signal will be available at the controller 40 
prior to the read data DQ. In view of this, a phase difference exists between the RCLK signal 
and the DQ data, the phase difference being represented by the propagation delay along the stub 
signal lines represented by dashed lines 80A and 80B. Assuming this time skew to be small, or 
assuming that the controller can compensate for the time skew, the configuration of FIG. 9 may 
be applied to the memory system of the present invention. 

In this manner, according to the above embodiment of FIGs. 3 - 8, a single free-running 
clock is provided through the same path as data signals in a memory system having a stub bus 
configuration. A single clock domain is employed for both read and write operations. For both 
operations, the read or write clock signal is routed through the same transmission path as the 
data, thereby increasing system transfer rates by maximizing the window of data validity. In this 
manner, data bus utilization is increased due to the elimination of a need for a preamble interval 
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for the strobe signal, and pin count on the memory module connectors is reduced. 

While this invention has been particularly shown and described with references to 
preferred embodiments thereof, it will be understood by those skilled in the art that various 
changes in form and details may be made herein without departing from the spirit and scope of 
the invention as defined by the appended claims. 



